20. Clean (Define)
Clean: Define
Clean: Define
Tasks
Task Description:
In the Jupyter Notebook below, in your own words, convert the assessments we made previously into defined cleaning operations, as shown in the above video. The missing values (NaN) and untidy dataset issues will not be cleaned in this lesson, so do not write a definition for those assessments.
Task Feedback:
Awesome! Now it's time to code.
Workspace
This section contains either a workspace (it can be a Jupyter Notebook workspace or an online code editor work space, etc.) and it cannot be automatically downloaded to be generated here. Please access the classroom with your account and manually download the workspace to your local machine. Note that for some courses, Udacity upload the workspace files onto https://github.com/udacity , so you may be able to download them there.
Workspace Information:
- Default file path:
- Workspace type: jupyter
- Opened files (when workspace is loaded): n/a
Defining, then Coding and Testing Immediately
For pedagogical purposes in this lesson, we will be performing the define, code, and test steps of cleaning data programmatically in order. In other words, we write all of the definitions, then convert all of the definitions to code, then test all of the cleaning operations.
In reality, it is often more practical to define a cleaning operation, then immediately code and test it. The data wrangling template still applies here, except you'll have multiple Define , Code , and Test subheadings, with third level headers (###) denoting each issue, as displayed below.
